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(54) DIGEST PRODUCTION SYSTEM 

(57)Abstract: 

PROBLEM TO BE SOLVED: To efficiently produce a 
digest image from video information such as a video 
obtd. by recording a program based on audio information 
that synchronizes with the video information or a telop 
video in the video. 

SOLUTION: A telop extracting part 104 extracts a telop 
video from a program video fetched by a video fetching 
part 101 and on the other hand, an audio information 
extracting part 105 extracts a characteristic image of a 
characteristic scene based on, e.g. a volume value of the 
audio information that is synchronized with the program 
video and is added. A combination processing part 106 
extracts a video part that newly becomes characteristic 
due to the step wise combination of characteristics parts * 

which correspond to the extracted telop video and audio information and a digest producing 
part 108 produces a digest video based on the extracted video. 
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JP 2G00-023G62,A 
* NOTICES * 

JPO and NCiPi are not responsible for any 
damages caused by the use cf this translation. 

I .This document has been translated by computer. So the translation may not reflect the 
original precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 
[Claim(s)] 

[Claim 1] The digest creation system characterized by to provide a sound information 
extract means extract the description image or the description scene of said image 
information, and a digest creation means create a digest image based on the description 
image or the description scene extracted by said sound information extract means, based 
on the sound information added synchronizing with image information. 
[Claim 2] The digest creation system characterized by providing a telop extract means to 
extract a telop image including text from image information, and a digest creation means 
to create a digest image based on the telop image extracted by said telop extract means. 
[Claim 3] A sound information extract means to extract the description image or the 
description scene of said image information based on the sound information added 
synchronizing with image information, A telop extract means to extract a telop image 
including text from said image information, The description image extracted by said 
sound information extract means or the description scene, and a combination processing 
means to extract the new description image combining the telop image extracted by said 
telop extract means, The digest creation system characterized by providing a digest 
creation means to create a digest image based on the description image extracted by said 
combination processing means. 

[Claim 4] Said sound information extract means is the digest creation system according 
to claim 1 or 3 which is detecting the condition becoming beyond the setup time as which 
the time amount which becomes beyond the set point as which the sound- volume value of 
the sound information added synchronizing with said image information was determined 
beforehand continuously was determined beforehand, and is characterized by to extract 
the description scene of said image information in the description image or the time zone 
correspond of said image information at the time of corresponding. 
[Claim 5] Said telop extract means is a digest creation system according to claim 2 or 3 
characterized by extracting said telop image by detecting the change on the time-axis of 
the image data in the field concerned for the field of a proper in the class of the image 
concerned on an image screen, or the class of said telop image. 

[Claim 6] Said image information set as the object of the telop image extract by said telop 
extract means is a program image. Said telop extract means A telop extract means for 
news flash to extract the telop image for news flash with checking change of the 
brightness information on a time-axis for the upper part on a program image screen, and a 
lower predetermined field, The change condition on the time-axis of color information is 
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checked for the whole program image screen. The digest creation system according to 
claim 5 characterized by having an emphasis telop extract means to extract an emphasis 
telop image by detecting the condition that the color extracted in the past on a time-axis 
and the left color became more than a setting ratio. 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention is used for the automatic image transcription 
system which is applied to the digest creation approach which creates a digest image 
from image information, especially records a program on videotape automatically, and 
relates to the suitable digest creation approach. 
[0002] 

[Description of the Prior Art] Some CS broadcasting is started besides a ground wave or 
BS broadcast in recent years, and viewing and listening of many programs is attained 
more. It will be expected from now on that the number of channels increases increasingly 
by digitization of broadcast. Thereby, the environment which chooses the program which 
suited its hobby idea from the channel which diversified the user more is ready. 
Moreover, the technical environment of quality improvement including HD-TV is also 
being improved by many channelization and coincidence. 
[0003] 

[Problem(s) to be Solved by the Invention] However, it is becoming difficult to grasp 
whether there is any program he wants for covering to become difficult and to watch 
what kind of program is broadcast with the increment in the number of opposite side 
channels as a result. For this reason, there was a problem of overlooking without noticing 
what is being broadcast also in the program in which the user is interested. 
[0004] When recording on videotape the program broadcast similarly, in order to have to 
choose a program to view and listen out of many programs, there was a problem that 
overlooking, a failure of an image transcription to take, etc. increased. 
[0005] Thus, although there is [ that it is various and ] a problem in order to choose the 
program image transcription in alignment with an individual hobby idea towards the 
multi-channelization age, in order to watch the program which he wants to watch to the 
image which further once performed the program image transcription, the time amount 
which carried out the program image transcription had to be spent conventionally, or it 
had to carry out fast forwarding manually etc., and had to see from the beginning, and 
there was a problem took time amount. When the program was not a program which he 
wants to watch with interest, serious time amount was useless. 

[0006] This invention was made in consideration of the above-mentioned situation, and 
the purpose is in offering the digest creation system which can create a digest image 
efficiently from image information, such as an image which carried out the program 
image transcription, based on the telop image in the sound information which 
synchronized with the image information concerned, or the image concerned. 
[0007] 

[Means for Solving the Problem] The digest creation system of this invention is 
characterized by to have a sound information extract means extract the description image 
or the description scene of image information, and a digest creation means create a digest 
image based on the description image or the description scene extracted by this sound 



2 



information extract means, based on the sound information added synchronizing with 
image information. In such a configuration, since the description image or the description 
scene of image information is extracted based on the sound information added 
synchronizing with image information, the digest image containing the description part 
relevant to sound information can be easily created by performing digest creation from 
image information using this description image or the description scene. 
[00081 Here for the extract of the description image by the sound information extract 
means, or the description scene That what is necessary is just to detect the condition that 
the time amount (time zone) from which the sound-volume value concerned turns into 
beyond the set point continuously turns into beyond the setup time using the sound- 
volume value of the sound information added synchronizing with the above-mentioned 
image information For example, what is necessary is to extract the image at the initiation 
time of the condition concerned as a description image, or just to extract the sequential 
image in the condition (time zone) concerned as a description scene. 
[0009] Moreover, it is also possible to classify the class of sound information other than 
the sound-volume value of sound information into voice, a sound effect, and sound 
musical sound, for example, for a user to specify, or to extract the image part 
corresponding to the description sound according to a user's hobby idea as a description 
image. Especially about voice, attribute discernment, such as an adult, a child, a male, 
and a woman, is performed. About a sound effect and music The image part 
corresponding to the description sound corresponding [ specify / a user / by preparing a 
corresponding sound pattern beforehand or ] to a user's hobby idea (for example, if a 
drive is the user of a hobby) The image scene corresponding to the transit sound of a car 
etc. is extracted on still finer level, and it can edit into the digest image according to a 
user's hobby idea by patching on a time-axis. 

[0010] Moreover, this invention is characterized by having a telop extract means to 
extract a telop image including text from image information, and a digest creation means 
to create a digest image based on the telop image extracted by this telop extract means, 
[001 1] In such a configuration, since a telop image including text is extracted from image 
information, the digest image containing the description part relevant to a telop can be 
easily created by performing digest image creation using this telop image. 
[0012] Here, if a telop image is extracted when the field on the image screen where a 
telop is displayed detects the change on the time-axis of the image data in the field 
concerned for the field in consideration of being the field of a proper in the class of the 
image, or the class of telop, an efficient telop image extract is realizable. 
[0013] When the above-mentioned image information is a program image, it is necessary 
to identify two sorts, the telop for news flash used for a news program etc., and the 
emphasis telop used in a variety program etc. Both telops have the different description 
as the telop for news flash is displayed on the upper part of a screen, or a lower 
predetermined field and it is expressed as the color in which an emphasis telop is a big 
alphabetic character on the whole screen, and specification is conspicuous. Then, about 
the telop for news flash, it is checking change of the brightness information on a time- 
axis for the upper part on a program image screen, and a lower predetermined field, and 
when change is large, it is detectable as an initiation [ of the telop for news flash ], or 
termination point in time. On the other hand, about an emphasis telop, the change 
condition on the time-axis of color information is checked for the whole program image 
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screen, and it is detecting the condition the color extracted in the past on a time-axis and 
the left color having become more than a setting ratio, and can detect as an initiation [ of 
an emphasis telop 1, or termination point in time. 

[0014] Moreover, this invention establishes a combination processing means extract the 
new description image combining the description image extracted by the above- 
mentioned sound information extract means or the description scene, and the telop image 
extracted by the above-mentioned teiop extract means, and is characterized by for the 
above-mentioned digest creation means to create a digest image based on the description 
image extracted by this combination processing means. 

[0015] In such a configuration, it is creating a digest image combining two sorts of 
descriptions, and creation of the digest image of high quality is attained. The time amount 
range of the description scene which is the description image information for example, by 
telop display can be determined by using both the descriptions gradually especially, and 
the high digest image of quality can be created more by extracting the image scene from 
which a sound-volume value turns into beyond the set point by the time amount within 
the limits. 

[0016] In addition, the more useful digest image for a user can be created by substituting 
the extracted image material for digest creation on a time-axis with the above-mentioned 
digest creation means. 

[0017] Moreover, when treating the digitized image, the image information 
corresponding to the extracted description is acquired as a static image, and the large 
digest image of the application range can be created more by carrying out grouping 
according to each description conditions made into the purpose. 
[0018] 

[Embodiment of the Invention] Hereafter, with reference to a drawing, it explains per 
gestalt of operation of this invention. 

[0019] Drawing 1 is the block block diagram of the information system equipped with the 
digest creation system concerning 1 operation gestalt of this invention. 
[0020] The digest creation system 10 records TV program image on videotape on the 
VTR tape as a magnetic tape medium by TV program reservation system 20, and 
performs digest creation processing to the image (video image) recorded on the VTR 
tape, consequently outputs it to TV program reservation system 20 as a digest image. 
[0021] TV program reservation system 20 cooperates with the digest creation system 10, 
and operates. TV program reservation system 20 performs image transcription processing 
of a program image in accordance with TV program which the user set up by considering 
the real image and TV program information (television race card) on TV (ground 
broadcast, satellite broadcasting service, etc.) as an input. 

[0022] Efficient TV program reservation system becomes possible by using it combining 
the two above-mentioned systems 10 and 20. 

[0023] For example, when a user looks at the image recorded on videotape in large 
quantities, it is inefficient to see over the same time amount as the time amount recorded 
on videotape, and it becomes useless [ time amount ]. Then, it is expected to the image 
recorded on videotape towards the many channels of the future digitization age that the 
need of creating a digest image becomes still stronger. In such times, the semantics which 
creates a digest image considers the significance of the need for a digest image of having 
been suitable for the so-called individual humanity news which included the individual 



4 



hobby idea in extent which the contents of the program image understand in addition to 
shortening in time. 

[0024] Drawing 2 is the block diagram showing the configuration of the digest creation 

system 10. 

[00251 In (drawings 2 , the image incorporation section 101 incorporates the image 
information recorded on videotape from the usual TV image, ********, cable television, 
etc. (with TV program reservation system 20 in drawing 1 ), and memorizes the whole 
program image in the image are recording section 103 which the image Research and 
Data Processing Department 102 has. The image Research and Data Processing 
Department 102 performs overall control management for creating a digest image to the 
image information accumulated in the image are recording section 103. 
[0026] The telop extract section 104 is 1 functional element for digest image creation, 
and extracts the static image (frame image) taken out from the program image 
corresponding to the program image point (elapsed time which carried out counting from 
the beginning of a program image) with which the telop was displayed into the program 
image, and the continuous dynamic image of two or more sheets. 
[0027] The sound information extract section 105 is 1 functional element for digest 
image creation like the telop extract section 104, and extracts the description part which 
summarizes a program image using the sound information which exists synchronizing 
with a program image. 

[0028] The combination processing section 106 is used combining gradually the telop 
extract section 104 and the sound information extract section 105, and extracts a digest 
image with the description part of the program image taken out by telop extract, and the 
new description part with the description part of the program image taken out based on 
sound information. 

[0029] The extract information control section (information change section) 107 changes 
suitably the telop extract section 104, the sound information extract section 105, and the 
combination processing section 106, and presents the digest image creation by the digest 
creation section 108 with them. 

[0030] Using individual humanity news including the hobby idea of the individual who 
got by the individual humanity news processing section 109, the digest creation section 
108 creates the final digest image based on the output of the extract information control 
section 107, and carries out transfer are recording in the image Research and Data 
Processing Department 102. 

[0031] The digest information output section 110 outputs and displays the digest image 
accumulated in the image are recording section 103 at other systems (here TV program 
reservation system 20 in drawing 1 ). 

[0032] The user interface section 1 1 1 makes an interface with a user, and performs the 
transfer to the individual humanity news processing section 109 of the individual 
humanity news which the user inputted, and the transfer to the image Research and Data 
Processing Department 102 of control information according to directions of a user. 
[0033] Next, actuation of the digest creation system 10 constituted as mentioned above is 
explained. 

[0034] First, the feature extraction (extract of the description point or the description 
scene) based on the sound information (specifically sound-volume value) by the sound 
information extract section 105 in the digest creation system 10 is explained with 
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reference to the explanatory view of drawing 3 of operation, and the flow chart of 
drawing 4 . 

[0035] Here, with the sound information which synchronized with the program image 
concerned, the program image of the long sumo match recorded by the image 
transcription tape (VTR tape) TP on videotape by TV program reservation system 20 in 
drawing 1 for 2 hours shall be incorporated by the image incorporation section lOi, and 
shall be accumulated in the image are recording section 103. The time amount range RI 
in the image transcription tape TP shown in drawing 3 shows one measure. 
[0036] What traced and graph-ized the sound-volume value in this Rl is shown in R2 
region. In this measure, the frame image (or when Thl is exceeded) of the beginning of a 
program image when becoming some one or more fixed sound-volume values (voice 
power level) Th defined beforehand is respectively extracted as an image of Dl, D2, and 
D3. 

[0037] The spectator was able to be excited as a measure situation, namely, if it thinks 
that it is shown that there were three scenes of the continuation time zone of the sound by 
the large cheer, it will become possible [ also extracting the program image 
corresponding to the time zone used as one or more constant value Th with sound volume 
continuously ]. When sound information is used, it can become a feature extraction with 
more effective extracting as a description scene which continued in time. 
[0038] Then, the sound information extract section 105 in this operation gestalt When it 
is detected that checked the sound- volume value of the sound information which 
synchronized with the program image from the head of the image transcription tape TP 
(step SI), and the sound-volume value was set to one or more Th It investigates whether 
the periods (time zone) when whether the condition a sound-volume value being set to 
one or more Th continuing two or more fixed time amount Th beforehand defined from 
the point in time, and a sound-volume value that is, are continuously set to one or more 
Th are two or more Th (step S3). And the sound information extract section 105 is 
restricted when the condition that a sound-volume value is set to one or more Th is 
continuing two or more time amount Th, and it performs the next image extract. 
[0039] The sound information extract section 105 judges first whether it extracts as a still 
picture, or it extracts as continuation drawing according to a user's assignment 
information that it was inputted into the image Research and Data Processing Department 
102 through the user interface section 1 1 1 (step S4). If the still picture is specified, the 
sound information extract section 105 will extract the frame image of the head of a 
program image where the sound-volume value became one or more Th (step S5). (as the 
description point by the sound-volume value) On the other hand, if continuation drawing 
is specified, the sound information extract section 105 will extract the program image of 
the time zone when the sound-volume values are one or more Th (step S6). (as a 
description scene by the sound-volume value) 

[0040] The sound information extract section 105 repeats the above actuation to the 
termination location of the program image recorded by the image transcription tape TP 
on videotape (step S7). That is, if it is the example of draw ing 3 , the description point or 
the description scene by the sound-volume value will be extracted by performing 
processing about the time zone Rl of each measure in the image transcription tape TP for 
2 hours of the whole of the image transcription tape TP concerned. 
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[0041] In addition, the monitor Ml on the actuation screen in drawing 3 is used for 
displaying an image transcription tape TP tape as it is, and a monitor M2 displaying the 
extracted description scene (digest image by the sound-volume description extracted by 
the sound information extract section 1 05 here). 

[0042] Drawing 5 is drawing explaining how the telop extract section 1 04 extracts the 
image scene which shows the telop to the image which recorded similarly on videotape 
the image program of the long sumo match explained by drawing 3 on the image 
transcription tape T? for 2 hours. 

[0043] In drawing 5 , the time amount range Rl of the image transcription tape IP shows 
one measure. In the example of drawing 5 , three scenes Dl 1, D12, and D13, i.e., images, 
are extracted as a telop image within Rl. The example of these images (telop image) 
Dl 1, D12, and D13 is shown in drawing 6 (a), (b), and (c). The telop image of a waging- 
war partner's wrestler name with which the image Dl 1 of drawing 6 (a) is displayed after 
a wrestler call, The image D12 of drawing 6 (b) the telop image of the past record - 
against-an-opponent table, and the image D13 of drawing 6 (c) The result of a measure, 
About the program image of the long sumo match which is the telop image of the 
winning technique at the time of receiving declaration of victory, since the pattern of a 
display telop is fixed, it can also extract Rl which is the one measure time amount range 
of the image transcription tape TP. 

[0044] The extract of the telop image (screen) by the telop extract section 104 is 
performed as follows fundamentally. First, the number of pixels from which the 
difference information on the adjoining frame image in the direction of time amount 
becomes more than constant value about the whole image screen is counted, and it carries 
out by detecting the image with which the counted value became more than fixed 
numbers. In addition, difference information on the adjoining frame image in the 
direction of time amount is made binary by constant value about the whole image screen, 
and you may make it detect the image with which the frequency of occurrence of edge 
data becomes more than a certain constant value. 

[0045] In examples, such as a previous long sumo match, the field on the screen where a 
telop is displayed is fixed like drawing 6 . in such a case, it becomes possible to boil 
extract precision markedly and to make it high with setting up and checking a mask field 
beforehand. The field on the screen where a telop is displayed is prescribed by each 
program in many cases. It is also possible to set up the mask field of a proper for every 
following, for example, treating as one of the hobby idea information of a user, i.e., each 
user's favorite program. 

[0046] Moreover, a much more efficient digest program image can be created by using 
combining the approach of the description image extract based on the extract of the 
description scene based on the sound-volume value by the sound information extract 
section 105, and the telop extract by the telop extract section 104 in the combination 
processing section 106. That is, if it is the example of a previous long sumo match, the 
time amount range of each measure can be determined for the description image 
information by telop display, and the high digest image of quality can be created more by 
extracting the image scene from which a sound-volume value turns into the one or more 
set points Th by the time amount within the limits. 

[0047] When the telop extract section 104 extracts a telop image here and it extracts the 
emphasis telop image used abundantly in the detail of actuation in the case of creating a 
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digest image in the case where telop images for news flash, such as news, are extracted, a 
variety program, etc., it divides and explains. 

[0048] Dra wing 7 (a) shows the mask field set up in order to extract telop images for 
news flash, such as news. Here, the oblong rectangle field 31 of the screen upper part or 
the obiong rectangle field 32 of a bottom of screen is set up as a detection field (mask 
field) on the television screen 30, and the telop image for news flash is extracted with 
checking the temporal response of the image in the field 31 concerned or 32. 
[0049] Hereafter, extract processing of the telop image for news flash by the telop extract 
section 104 is explained with reference to the flow chart of drawing 8 . 
[0050] First, the telop extract section 104 takes out the image frame before and behind on 
a time-axis from the head of the program image recorded by the image transcription tape 
TP on videotape one by one, and the difference information on a multiple value 
(subtraction image) which consists of difference of the pixels, for example, brightness 
data, of the same location of the mask field (31 or 32) of both the frame image is 
searched for (steps SI I -SI 3). 

[0051] The telop extract section 104 about the difference information (subtraction image) 
on the mask field (31 or 32) next, with a multiple-value image When the number of the 
pixels beyond a certain setting threshold is counted (step SI 5) and the counted value 
becomes more than the fixed (decided by total number of pixels of mask field) number 
(or fixed rate to the total number of pixels of a mask field) in the field concerned It is 
judged that telop images for news flash, such as news, were extracted (step SI 5). 
[0052] Only by this decision technique, it becomes the extract only at the time of the time 
of a telop image being displayed and a telop display being completed. So, the following 
technique is applied with this operation gestalt 

[0053] Namely, if the time of a telop image being displayed is detected, the telop extract 
section 104 While holding the last frame image, i.e., the frame image before [ one ] 
displaying a telop, (steps S16 and SI 7) The flag (telop graphic display beginning flag) of 
the purport that the initiation time of a display of a telop image was detected is turned on 
(step SI 8), and the present frame image (here top telop image) taken out at step S12 is 
extracted as a telop image (step SI 9). 

[0054] Henceforth, the telop extract section 104 takes out a consecutive frame image 
(step SI 2), and the difference information on the frame image before [ one ] displaying 
the telop held at step SI 7 is searched for (step SI 3). Thus, by searching for the difference 
information on the frame image before [ one ] displaying a telop, as long as the telop 
display continues, the number of the pixels beyond the setting threshold in difference 
information (subtraction image) serves as a big value more than the fixed number 
continuously (steps S14 and SI 5). Therefore, all the frame images in the period when it is 
in this condition, and (steps S16 and SI 9) and a telop are displayed by extracting the 
present frame image as a telop image when the telop graphic display beginning flag turns 
on can be extracted as a telop image. 

[0055] If the number of the pixels beyond a setting threshold stops fulfilling the fixed 
number soon, the telop extract section 104 turns off a telop graphic display beginning 
flag (step S21), after holding the present frame image, will take out (step S22) and the 
following frame image (step SI 2), and will repeat the processing after the above 
mentioned step SI 3. when the number of the pixels beyond a setting threshold stopped 
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fulfilling the fixed number, in order [ thus, ] to return to the usual display — difference ~ 
the number of the pixels beyond the setting threshold in information decreases. 
[0056] Drawing 7 (b) shows the mask field set up in order to extract the emphasis telop 

image currently used abundantly in the variety program etc. Here, as a detection field 
(mask field) on the television screen 30, the rectangle field 33 of the whole screen is set 
up mostly, and an emphasis telop image is extracted with checking the temporal response 
of the image of the field 33 concerned, it is in the point that a teiop image is passed by 
the color which is a big alphabetic character on the whole screen as a description of an 
emphasis telop image, and is conspicuous. 

[0057] therefore, this operation gestalt shows to the flow chart of drawin g 9 - as -- the 
extract of the previous teiop image for news flash — the same — carrying out — the 
difference on the time-axis of the image in a field 33 -- it processes (steps S21, S22-S24), 
and also the following processings are performed. 

[0058] First, when the number of pixels beyond the setting threshold in the difference 
information (subtraction image) on a mask field (33) turns into more than the fixed 
number, the point (steps S26-S28) which holds the last frame image, i.e., the frame image 
before [ one ] displaying a telop, and turns on a telop graphic display beginning flag is the 
same as that of the case of an extract of the previous telop image for news flash. Differing 
from the extract of the telop image for news flash is the point (step S29) of extracting the 
period when the number of the pixels beyond the setting threshold in difference 
information (subtraction image) serves as a big value more than the fixed number 
continuously, and the present [ each time ] frame image at the time as an emphasis telop 
image candidate, and searching for the frequency information on the color in the frame 
image. In this case, by using a hue value as color information, it is stabilized and the 
extract of a color can be performed. 

[0059] Now, if the number of the pixels beyond a setting threshold stops fulfilling the 
fixed number and returns to the usual display, the telop extract section 104 turns off a 
telop graphic display beginning flag, after holding the present frame image, it will 
measure the color frequency of (steps S30 and S3 1) and each emphasis telop image 
candidate who extracted till then, and will look for the color which serves as a fixed rate 
in the mask field (check field) 33 (step S32). And when the color which serves as a fixed 
rate in a field 33 exists, the telop extract section 104 judges the corresponding image to 
be an emphasis telop image, and memorizes it (step S34). When the color which serves as 
a fixed rate in a field 33 on the other hand does not exist, the telop extract section 104 is 
discarded noting that the corresponding image is not an emphasis telop image. 
[0060] Here, as a digest image, the contents of the program image original with the image 
of only an emphasis telop image may be unclear. For this reason, after an emphasis telop 
image is displayed, it becomes possible to make the contents expressed by the emphasis 
telop and an image correspond by extracting the image of the time amount defined 
beforehand. 

[0061] Next, how to substitute on a time-axis in the digest creation section 108 is 
explained with reference to dr awinu 10 using a user's individual humanity news to which 
the image material extracted to digest creation is given through the user interface section 
111. 

[0062] First, for example as a viewing-and-listening style of the cooking program image 
40 of drawing 10 , when [ in a user's TV program ] the user concerned actually needs to 



9 



cook things, he wants to see the image of a cooking name or recipe information first in 
many cases, and a cooking process is referred to if needed. 

[0063] However, with the digest image created by usual by technique which was already 

described, like drawing 10 , the sequence of the description scene needs to serv e as a list 
of the order of the telop image D2I of a "cocking program title", the telop image D22 of 
a "recipe", and the telop image D23 of "completion", and needs to look at all the time 
amount (here 10 minutes) of the cooking program image 40 as a user. 
[0064] On the other hand, it arrange in order of the image D23 of the "completion" telop 
which substituted the time sequence of the extracted description scene like an arrow head 
41 in consideration of a user's individual humanity news (liking), for example, displayed 
the cooking name by the digest creation section 108, and the telop image D22 which 
displayed "recipe" information, and the image scene of an actual cooking process be put 
in order after that. Even if the image scene of this cooking process extracts the thing of 
the section of the time amount L inserted by the telop image D22 of a "recipe", and the 
telop image 23 of "completion" as it is, and it samples in time, it is not cared about. 
[0065] Although the above explained the digest creation system 10 used combining TV 
program reservation system 20, it does not restrict to this. 

[0066] For example, as shown in drawing 1 1 , it may be used, building the digest creation 
system 10 into the system (content provider side system) 50 which offers contents, such 
as TV program image. That is, it is also possible to use the digest creation system 10 for 
creating the digest image of the program image as which the content provider side system 
50 was required from the user side system 60. 

[0067] In the configuration of drawing 1 1 , the user side system 60 requires the program 
image which the user itself wants to see of the content provider side system 50. Under the 
present circumstances, if it shall be constituted so that the system 50 concerned may send 
all the program images to the user side system 60 from the beginning when receiving 
offer of the demanded program image from the content provider side system 50, while 
transmitting an image with much capacity will take time amount, there is a problem 
whether it was the program image which a user wants to see truly. 
[0068] On the other hand, in the video on-demand system of drawing 1 1 , by the digest 
creation system 10 built into the content provider side system 50, the digest image of the 
program image which the user specified according to the demand of a user can be 
created, and it can provide for the user side system 60. Thereby, when the rough contents 
of the program image first made into the purpose with a digest image have been grasped, 
if a user is a program image to see truly, he should just demand a transfer of the original 
image of a program image again. 

[0069] By the way, when a digest image is considered to be one epitome which shows the 
contents of the program image, it is not necessary to be the information which continued 
on the same time-axis towards digitization of the image spread is expected to be as an 
expression of the epitome of an image program which carried out clustering on the time- 
axis from now on. Using as an epitome which shows the contents of the movie is also 
possible by sticking on the movie introduction scenario map which extracts the movie 
contents for DVD as a static image in the image corresponding to the description 
information, and has prepared them beforehand there. Of course, if interested in the 
epitome, it is also possible to appreciate a original image. 
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[0070] Moreover, as applied with said operation gestalt, each point image on a time-axis 
by which it was characterized [ of the program image ] is extracted using telop 
information and sound information. By rearranging and editing into arbitration on a time- 
axis, without editing on the time-axis which the digest program which continued on the 
time-axis was created, and also original followed in each point image on the time-axis 
concerned When appreciating a digest program, it makes it possible to see a result image 
immediately. As an effective example of edit by such rearrangement, application on 
sports program images, such as baseball and soccer, is. Here, the scene of seeing a game 
result first and seeing the contents slowly later can be considered. 
[0071; The functional element group which constitutes the digest creation system 10 
described above. Or the procedure performed in the procedure performed by the digest 
creation system 10 especially the telop extract section 104, and the sound information 
extract section 105 A computer is equipped with record media, such as CD-ROM by 
which the program for performing the procedure concerned was recorded on the program 
for operating a computer as a set of the functional element group, or the computer. It 
realizes by reading and performing the program concerned. This program may be loaded 
to a computer through communication media, such as a communication line. 
[0072] 

[Effect of the Invention] As having explain in full detail above , it be base on the 
approach processings , such as an information compression on a simple time-axis like 
of VTR equipment and image recognition , usually take time amount instantly to image 
information , especially the program image which recorded on videotape according to 
this invention , but since the scene and the scene used as the description of the program 
image concerned extract using the telop and the sound information expressing the 
attribute of a program image and it be consider as a digest image , the digest image 
concerned can edit flexibly . 

[Brief Description of the Drawings] 

[Drawing 1] The block diagram of the information system constituted combining TV 
program reservation system in the digest creation system concerning 1 operation gestalt 
of this invention. 

[Drawing 21 The block diagram showing the configuration of the digest creation system 
10 in drawing 1 . 

[Drawing 31 Drawing for explaining the feature extraction based on the sound 
information (sound- volume value) by the sound information extract section 105. 
[Drawing 4] The flow chart which shows the procedure of the feature extraction based on 
the sound information (sound-volume value) by the sound information extract section 
105. 

[Drawing 51 Drawing for explaining the telop image extract within the program image by 
the telop extract section 104. 

[Drawing 61 Drawing showing an example of the telop image extracted from the program 
image of a long sumo match. 

[Drawing 71 Drawing showing the mask fieSd used for the telop image extract by the 
telop extract section 1 04 about each of the telop image extract for news flash, and an 
emphasis telop image extract. 
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(Drawing 81 Drawing showing the procedure of the teiop image extract for news flash by 
the telop extract section 104. 

[Drawing 91 Drawing showing the procedure of the emphasis telop image extract by the 
telop extract section 104. 

[Drawn:*; 101 Drawing for explaining the actuation which substitutes the image material 

extracted to digest creation on a time-axis by the digest creation section 108. 

[Drawing 1 i] The block diagram showing the example of a configuration which applied 

the digest creation system concerning 1 operation gestalt of this invention to the video 

on-demand system. 

[Description of Notations' 

10 — Digest creation system 

20 - TV program reservation system 

50 — Content provider side system 

1 02 — Image Research and Data Processing Department 

104 - Telop extract section 

105 — Sound information extract section 

1 06 — Combination processing section 

1 07 — Extract information control section 

108 — Digest creation section 

1 09 — Individual humanity news processing section 



12 



<19)R*H»#j? (J P) (12) & p§ f$ & ^ (A) (lI)fc#ffif^fW 

4$gS2000-23062 

(P2^C0 -23362 A) 

H0 4N 5/115 H0 4X b/44b Z 

G 0 6 F 17/80 G 0 6 F 15/4C 3 7 0 D 

I5/4CI 3 2 0Z 



S&iifcR jfcd&w i ; ;:*k0&6 FD 11 u) 
(ZDffiSHW fig?F10-199517 (7DH53A OCOiXBCTfi 

j (72)^S3£- 

#^;iimji[^^^si7o^ *a&a 

U4)tfMA 100058479 



(54) [^^^^) ^yaXhM'>XfA 



(57) ism) mstm 




i-i v - > * u £ v / > * x *- t^^ft"« « ■ ft ^ / v ^ 
* M> "\ *; '.c-^ v, - ^ -<r ft v w A -> 

1 3<h * 1 1'^' ft ~ <h ^ ^ ft * ^ 5> ^ X h (T h\U X 

£*&H3T ft t <l -CKjwaf-a ? ^K^^isaj-r ft C t 

'<&t T $ fc(*£fi*JB 3 S^x X h ft 

i ft & ho w£»«ts«^»ta»ft r ft o . 

>■ £ £ <LT2iQJ8r n v :/^.££&Tfti*i^T 

w t /ce i ttn/ce^s«EH:$«±4 ft o /ctt$£$a] 



^ ^20 0 0 ^3062 

Id n ; j 

It: a: 2 j 

T * /c. :l- UUttSO ry>yc;t!r.J;;j 

* W C i *\t?.^ h ft. C #1:: J: c j fijffi 

[000 3] 

ztc totcft, mm?ii)wmt$^xi,>z>8mxi>tk 
mLxi^ct (c$\~3t>r ic&£LXL%*>tj; tomm 

[0004] w&i£tm$ftit$&%$m?z>mG(c*$ 

Wtitf ft 6 ftC ^/ci* , L^SiacOK 9 Eftft 
30 [00 0 5 ] C(DJ:^tC, *^+>*;UffcB$ft(c^W 

ft^'Lr, *fc©ffl*»6Hftt*htfft6-f, Bir B w^^ 

»o -CMfct i#fflr ft* o fc«6«:ttAaftK!FPifl<7)j(sK 
<tft-?rci/Co 

[0006] **Wtt±ffiTO*«ffiLTft Sftfcfe© 

T ft C t ifix * ^> y -f 5/ x x h TO^x r A ^fi&fit^r ft 
C<h(cftft u 
[0007] 

iJi^^f^ ft/r^corR] *»jV-{y.T x h vl 

50 m $ tiMm&m * fc it « a*/ - >£&<h(t y-r^xx 



t -cf f "id * ix i * * nr, a* & c ::: . s-^ttaosj 

v j > .... v A r . ~ > ;v<X £ 

i o c o s ] uiKaoiSi-.^ttoteic. ingots 

-If ©JS2i-r S, #1 » (*a - If It E£ D 

P v Unfile J: 0 ttUi £ ft fc r D y fc i (c 

[001 l] Cfl)J:^&18StK:fci»rtt, BMSiefga^ 
X^Wa^rStrf-o ?^{W$UJ3ft£©T\ C(Dr 

I 0 0 1 2 ] C cr, fa? 7fr&ii$hZ&mm.k 

mm? a v 7izmm®±&&tc\tTm>m%. 

&t£&;&j«3ft. yfa -y^Wlia^fcA^^^: 



n > U 0 0 0 2 3 062 

4 

o^:^^^.»liUL^ f ret. isgfflfa 
c i r . 'S'fcoAfc i *E t a; ^(n^ 7- n r/e^F^ 

£&£<o, .Ldd^Vvxx hr^xff5^^>rvx>. 

[ooi6] coffetc, amofc^yx^ hftfiitffl© 

o^x^cir, a-ifcciorj:0Wffi^^^xx 
30 hWft^ftfijcr&s. 

[oo i 7] $/ c , ^driWbLfcSfeft^a^ja&ec 

[00 18] 

[ o o i 9 ] m 3 {**«^©-3Ei6fl5sg(c«s^-r ^* 

[0 0 2 0] ^yxXhiM^fAl Ott. TVf 

i Lr© V T R f - VtcypjO, f©VTR f - ?lc& 

^^:ijt\ -c-ojSSi;:. ^ <vx^ LZ i v o 

ffl - v * f A 2 0 IC w A "T £ ©rfc s e 
[002l]TVif§f^fA20(l, ^yx^ 

so ^fA2ou. tv a&mm. vatm&ttt') <ov 



(4) 



T . a - -tf-Ai^tf LfeTV ^Hi^ r^US^C) 
I 0 C, 2 Jr. ] l. t ££ 'JO> AiC, 2 u £S&iV*) 

[00 24] l*j2Ui^va.X hfp^vxf A 1 0GDf3 
[002 5] M2lCkUT, 0 1« 



id 



0 CU 2 3 062 

b 

: o o 3 j y > ^ v i • i i i :t.-7— if i 

O-^ SVi- Xrt .r-±,o r .-■ - --«<■>?, ! ;L/.- r >" A. 

I <! o 3 3 ] Kz. « Hi J: >■> - A 
1 1! 0 3 4 ] £ t. Wi^X |- <V.U X /Aid |»jo 

Hi) tZ-)L>T, N 3 OHtH = ■iiiiyW/! fivu-f t 

[0035] ccra, w 1 * ot vsffl r^^xA 

2 0(cr^W-7" (VTRr-7-) TPK2B$figj^j 



S 1 0 3 KfiMRfit^fcfcag-r t*&ffi«lff]IS ] 

0 2 «»»Sf?i3B 1 0 3 (cm! L fc§fct&t.V$K*f LT * 

[0026] xn-^tofflSSl 04tt. ^i?xxm 

was©**© i *j . sentfltitKc^a 



»ttSH»10 3ttS«S3#iri»St«!)i«. ®3k: 
[0 03 6] C©Rlrt©^ifj^|-U-Xtr?v7 

fti/fcfc©*R2«(c*r. coiRoa*rtt. y-ftsi 

.tittofciS (&30ttThi£iS;L;fc£i*) OSffl^ 
«©«aj©7U-ABft*»*Dl. D2. D3QSffll 



^LfcMfiBSRg) iC^tfcflft^^,, fflSH 30 [0037] UMaWfU-t th wt M ^ 



(0027) ^ti5$HflSttlgiJ 1 0 5 «. T-a-^ftmgjJi 

o 4 i^SK: ^-r y x x i- 6WSftfiK©/c«)© i titers 

[0028] ffl*^t>W®8? I 0 6 «. fn, 

as i o 4&vwm.m* 1 o 5 ^epgwcc^^tt 



[0038] *C r*aa6J5»K*»w &«tiMB»tt8|l 1 



[0029] JflHJt.^KWiaffl (tfiffiVJ^fjK.SS) 1 0 7 

a. fp , -/mm io4. mriffiMtus 1 o 5 . rv 
[oo3o] ^y^^x Kffssi! 1 0 8«. mA\mn& 



***** Uf,«3). *LrBKf«auiaj 0 

[0039]$ r«1i!«fl«l US ] 0 5 (*. a-w^j, 



4)o 4>U ©Jti*j36»|g2l3hri*5«c6t*, Sffcvaa 

in 04 i : L^3*©}-;;{Tr;i£:i.o^ --i'Vii :± 
ttHiSftfcftfi^--;, (ecru, StfiftttiMiiius 

[ 0 0 4 2 ] » 5 im 3 Taw Lfc**BS©8»»ffl* 

r*]«K«itOr-- y'T p (c 2 ^^[sj or , 

[0 043 ] K5CC*$OT, iiliJr-^TPCDS^^ra 

1. D12. D13£aHJLTC>S 0 CCDPOflUf-n* 

D 1 1 , D 1 2. D 1 3©H(t^cg6 
( a ) . ( b ) , ( c ) CC*Vj\ &J6 < a ) <Di.ajftD 1 

rP yy'MB. [«36 ( b) ©Pj&D 1 2(iH*©*t«6£ 
S^OfPj/ifift, ^6 (c) ©H&D 1 ZltZWWM 

[0 044] f-n-^^ttHiSl 0 4tcJ;£TP?7 r Liy& 
(PS®) ©atMi, S*WCctt;xcDJ:^WTt>*iS. * 

©GWSTav u~Atajft©£ffi«£ -JEffrc2£MfcU 
ft v: {"Sir J: -5 

[004 5] 5t©Affi«&£©g5-tii. ro y/fAK^ 



?ii&2 0<)0 23062 

[ o o 4 g ; u^KJSiii^ i o r> ;jon kikz. 

™ -UttvtfJfc* lis*, x f- ^ ? K:r v (:^i}.bVr^r. <h 

[0 04 7 ] CC-Cy-p-^yaaiSl 0 4 iCJ:«J ru 

[0 04 8] 1^7 <a) jj. -^-^&4'©k«ffl^n 

WT0Il©^(meB^3 2^^L, M j u M^3 1 
[004 9] OT, fD ? ^affiS 1 0 4CC <fc*jl«ffl 

f-o 5r^K«©afflJ&g{coc»rpg8©7n-^ + - h 
[0 05 0] SE-r^D^^aUiSUl 0 4(J, SSf-^ 

t p ircmm $ n/c#ffl8ws©5fegB*> h mm k<ow&<D 

«« (3 1 *fctt3 2 ) ©H-aK©W*K±©WAtf 
(^f^Sl l-vS13). 

[oosi] *(ctd v yatbsu 1 0 4 tt, ^© vx ^ 

«« (3 1 *fc»3 2 ) ©|g(n» <£#P0®) (COC^ 
40 #^>h0 (^f^S 1 5) , *<0i7*?>Ymi&\& 
(*St*ttvx ^««<3!»Sti6«i»(C»-3-S -^©^) 
^aai$n/c<t«T^ Uf7^S15). 

[0052] c©wm/c^r:<t, xD y 

[005 3] W%TUvimm\ 04tt, fa ? 7'^ 
50 fa^»/i^*iSB5«s^ttHs-rs£. aViltjOv u-A^j 



9 

}i'4*5^ftiC (Xv vV'S i o, S I 7) vP7 7^ 
(■^'hSvvv) £C\'L, (Xv'v v*S 1 8 ) , x-> 

-•v s i 2-rn ! JMfL3fc5jy » ---'c\f;: ( l 

Ar.O^i.^l!;;. r vs: n i . --v.- ,-/s 

: 7 "C { t L £j 1 1 v ■ ; : v v" £ ^ - j >j A - j , 1 ^ c ; y l- 

SI 4. S15) c Ltcft->Z. COttErc. ft^ru 

i6, s 1 9) , TVyy^&r^tix^&mm 

[0055] 3^fi!U±<0Lfij^O^3&$-^®a 
£tfi3ft:&<&&£. f-atF^ttfflgiJl 04t2rP>7*& 
M«?7^0FFl (^f77'S2 1 ) , 31? 
L"-AiSHS^|$o/c±-C (Xf->7'S22) . ;*©? 
^-Ajftj««MfflOT( Xr,7'S12), if£0fc 

xf-^-s 1 zimoimzm&r. c ©<««:&£ 

&±©lS&©&&'>& < fc 

[0 0 56] $7 <b)tt. '<7X7W»fflfc£-C*ffl 
^■7^^ta^ifvTo CCt«. rUb1i®3 0±r© 

&mm(-7*t>mm tor. mmm±w<o^B^ 

[0 05 7] b/c*Sor*3BSSBIBrw. 1907D- 

iwaccLr. ffi**3 3rt©P3«©aj(ffl«i_h-coa»je 

JIUr?7S2 1, S2 2-3 24 > fcfrOtefc. « 
K©£OT.£'<f^ e 

[0 058] vzttm (33) ©£tf/ffl 

t^ iiwx. % 'vi H.j c;vu a & { ■ : . Eji *» 7- u j j & 

>"R*r & 1 ->Bira> 7 u - Mmtzmz l , fu ? 

if«7 7^0NT^ 4 Ur*>7*S2 6-S2 8) 



(b) ^u]2 000 2306 2 

10 

< £$H*jR ) fcfett 6«fcEFflOU;l h©ltj&©i»&& L 

-rf!©^i % tttm- ( x r v v s 3 2 ) . -v 
Lr-rU-^7*}SW^i ()4;,t t fi^3 3f^T 

v^<t«Ur n dte:T6 (7/v7S34) t -A, 

[0 0 6 0 ] ccr. ^y^xhefciaiLrut. ^jh 

% C t (c J: 0 . y'r fcrtg iKfit^ *t 

I0 0 6 i];^(c. ^Vv^x h{l*ltffl«:atfjLfc»^ 
Sit*^a-if-<>di7x-^si 1 i^iitr^x.6^ 
S^.-1f©flAt»«€rttoT. ^y^HMlO 
30 8tcrB$F H ™±r^ugx^^coc^. @io^ 

[0062] £?*-v<dtv mmfciow zwjL&m 1 
o©«a*a?*«4o©ate^5r^^ii / -c t saa- 

^trt»»©«ft^iaa«cM/ct*ci^<, 

[ 0 0 6 3 ] <h C S(cMi^</cJ: 0 ft^ffirastc 

it, o 1 0©cfc rsssfflif-/ h;uj ©fa^ 
40 L*1D2 1, r u^t'j ©rn^^Hj^D2 2. ^LT 
f ffl*±# 0 j ©f p 7 D 2 3 ©J3©#tf £ fj: 

0, a- -^£i,r(Arnisffl»<a4 o©B$[gj (ecru 

[0064 ] C*!<t*tOT, ^V^xXI-fTSRSl 08 
CCJ:0, ^■- < /©<WAirj« ^^6tbr. XEP4 

7*©«D 2 3 . r U2/e j tmZ&niLtcf-v v 7>j 
f^D 2 2 ©ratc^. *€-©ffiCcgEia©f41iigll©tt»S/ 
50 ->**^5 0 c©t-tJffie3K©afii^->«. ruf> 



n 

f = (Dr u v vi^ED 2 2 <h I ffijRJ ^0 j <jr>^ u v y 

! o t? « s : w Kt. T\-;r>^ >v-^X7-a 2 n t?A* 

; ;i r e ] K ;o i :^-4 w^-.r.x 
! -V: : rv.-^^-r.^^ ': = >■>■>••/ 

*£f* * 0>XrA ( i>v 7'tln { ^(-yXv 

I v - > ;£ 1 J . iiyry'WUAV^ j,;; > a fA o 0 A» 

x K^:£ttbV4'£6;>;;:^ fyxXFMvXrA ] 

[ 0 0 6 7 ] N 1 l^MCfct^. a^^fFJvXxA 
u 0 ;s, a -ir Qlj ^-W/cc iSffl^f-S^ ^ > r > V J u 

3 > r > 7 7*n ; W ^{E;J r A 5 0^ 

<d±t Zi.-ymuz f-A 6 0 icS5 <fc *> CC*bS3 tir 

[0 0 6 8] CfttCjtfOT. Ml 10tf*-t> - -r 

^(D^^y^FRMMU, a— */§jvXrA 

[0069] <bC*>T, y-/yxXf»|feMii»i© 

SISU/dB*R-C*S!2*ttttl» c *CT\ m^DVD 
81® ^ > r > 7 £ *<D«at?/«<c*tl£ b fcK(S*© 



(?) ^2 0 0 0 2 3 0 0 2 

Tic, D$fi3$£ tTff.SK.ft'^* T8fiW S <: <t <c J: 

[ 0 0 7 i ] i"x f-i-;--v/ L >w>. - x H'r^X-rA : 
i:u:.t . : J > t\* ~ * * * of : lt ^.^Kt ^ L T(\fl 

£ it s fca^co vui/ -> a . ts ?> t * u u > t" .-x - - £ ic 
^fi 1 f IM'A;* r <^ ii /, taft<o v" u v J^&JZhtL C 

Ji'J V # ? A £ ^ iR 1 J 'k \ j 2 ^ £ C i iC J: ij <? 
[0 07 2] 

14 ^ ^ b r C ^ r D » ^ * £ {£ o T , M S*S£a 
<L & - > J&a Hi LT ¥ A v * ^ b 

30 ^rA^TV#^^^xrAi^^iir^^^ 
[[532] ^l^^^^^hMvXfA l 0©*8i£ 

im4 ) stisaauigpi 0 sccjcastsfa (if©ffl> cc 

[[«97 ] ru 9 -/jatasi 0 4 tci;^ru 



(8) 



* 1 0 2 

t n / 

1 0 fi 

1 C i- 

1 0 7 

I (J 8 

i I) M 



?S!£2 0 00 2 3 062 
14 



1 1 \ » j 



1_X 
— ?r 



? 1: U. ^. D12: 

xx xx 



r 

1 0 



(a) 



im2) 



i o i 



1 0 2 











> 

1 0 5 
















I 



1 l 0 



<£ »J « *J 



(b) 



, l 



£ 0 



^ > y > V 



<3f#> 




!H5] 




M2 



(10) 



?t!£2 0()0 23 06 2 



./** 



3Q- , 



* >j, 33 



t 

L 



<b) 



r ~ 



YtS 




— £ 1 



tr Q y 7fett&7T<ggjg y ^ y O N ]^ S 1 8 

b— 




— J. 1 S3 7 

T — — i I 





S 3 3 




'S 3 2 



*7^-Atfisasya» ^S2 9 
I. I 



-MO S 3 5 

^•3/C»\7^> j * 



so I 



on 



[RIO] 



tea 



L - - _ 




